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PI UORESCENT PROTEINS 




Tha present invention relates to novel vanants ot tne nuoresceni proiein 


5 


GFP having improved fluorescence properties. 


The use of Green Fluorescent Protein (GFP) derived from Aequorea 




victoria has revolutionised research into many cellular and molecular-biological 




processes. GFP allows researchers to label proteins within cells with an intrinsic 




fluor, so obviating the requirement to perform chemical labelling ot proteins, and 1 


10 


allowing development of assays of biological processes in intact living cells. 




US 6491084 describes the use of GFP as a biological reporter. Early 




applications of GFP as a biological reporter (Chalfie et al. Science. (1994), ^3, | 




802-5; Chalfie, et al, Photochem.Photobiol., (1995), 62(4), 651-6) used wild type 


15 


(native) GFP (wtGFP), but these studies quickly demonstrated two areas of 




deficiency of wtGFP as a reporter for use In mammalian cells. Firstly, the protein 




being derived from a poikilothemiic marine organism does not undergo protein j 




folding efficiently when expressed in mammalian cells cultured at 6/ o, resulting 




in weak fluorescence. Secondly, the spectral characteristics of the wtGFP are 


20 


not ideally suited to use as a cellular reporter, requiring excitation with : 




electromagnetic radiation in the near-UV range, wnicn is potenuaiiy aamaging io 




living cells. 




Consequently, significant effort has been expended to produce variant 


25 


mutated forms of GFP with properties more suitaDie ror use as an iniraceiiuwr 




reporter. 




A number of mutated forms of GFP with altered spectral properties have \ 




been described. A variant-GFP (Heim et al. (1 994) Proc.Natl.Acad.Sci. 91, 


30 


12501 ) contains a Y66H mutation which blue-shifts the excitation and emission 




spectrum of the protein. However, this protein is only weakly tluorescem ana 




requires potentially damaging UV excitation. ; 
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A further mutant of GFP (Heim et al, Nature, (1995), 373, 663-664) 
contains a S65l mutation which red-shifts the optimum excitation and emission 
wavelengths relative to wtGFP and which is 4-6 fold brighter than wtGFP when 
expressed as a recombinant protein at 25 C C. However, this variant does not 
yield bright fluorescence when expressed in hosts cultured at 37 °C. 



Ehrig et al (FEBS Lett., (1995), 367.163-61 describe two mutations of GFP, 
T203I and E222G, which Individually delete one of the excitation maxima of 
wtGFP. The E222G mutation deletes the near-UV excitation peak at 395 nm and 
io produces a red-shift in the excitation peak at 475 nm to 481 nm. The emission 
peak for this mutant protein is at 506 nm. 



W096/27675 describes two variant GFPs, obtained by random 
mutagenesis and subsequent selection for brightness, which contain the 

15 mutations V163A and V163A+S175G, respectively. These variants were shown 
to produce more efficient expression in plant cells relative to wtGFP and to 
increase the thermotolerance of protein folding. The double mutant 
V163A+S175G was observed to be brighter than the variant containing the single 
V163A mutant alone; however this mutant exhibits an undesirable blue-shifted 

20 excitation peak. 



A further mutant, termed cycle-3, generated by molecular evolution 
through DMA shuffling (Crameri, A. et al, Nature Biotechnology, (1996), 14, 315- 
9) is available commercially from Invitrogen Inc. Cycle-3-GFP contains three 
25 mutations (F99S+M1 53T+V163A) which increase whole cell fluorescence 

approximately 42 fold when compared with wtGFP. However, this mutant retains 
the near-UV excitation maximum of the wtGFP, making it less suitable as a 
reporter for use in living cells. 

30 The above mutations effectively address some of the spectral deficiencies 

of wtGFP as a biological reporter in providing variant forms of GFP which are 
compatible with lower energy excitation and which emit at wavelengths 
compatible with detection instrumentation commonly in use for measuring 
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biological reporters. However, such mutations do not address the problem of 
inefficient folding and chromophore formation when wtGFP or spectral variants 
are expressed in hosts requiring growth at temperatures significantly greater than 
ambient. 

5 

US 6172188 describes variant GFPs wherein the amino acid in position 1 
preceding the chromophore has been mutated to provide an increase of 
fluorescence intensity. Such mutations include F64l f F64V, F64A P F64G and 
F64L, with F64L being the preferred mutation. These mutants result in a 
io substantial increase in the Intensity of fluorescence of GFP without shifting the 
excitation and emission maxima. F64L-GFP has been shown to yield an 
approximate 6-fold increase in fluorescence at 37 °C due to shorter chromophore 
maturation time! 

15 In addition to the single mutants or randomly derived combinations of 

mutations described above, a variety of variant-GFPs have been created which 
contain two or more mutations deliberately selected from those described above 
and other mutations, and which seek to combine the advantageous properties of 
the individual mutations to produce a protein with expression and spectral 

20 properties which are Suited to use as a sensitive biological reporter in mammalian 
cells. 

One mutant, commonly termed EGFP, available commercially from 
Clontech Inc., contains the mutations F64L and S65T (Cormack. B.P. et al, Gene, 

25 (1996), 173, 33-38). These mutations when combined, confer an approximate 
35-fold increase in brightness overwtGFP and the spectral characteristics permit 
excitation and detection of EGFP with commonly used fluorescein excitation (488 
nm) and emission filters (SOS nnv-530 nm). EGFP has been optimised for 
expression in mammalian systems; having been constructed with preferred 

30 mammalian oodons. 
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US 6194548 (Osumi et al) discloses GFPs with improved fluorescence and 






folding characteristics at 37 °C that contain, at least, the changes F64L and 






V163A andSl75G. 




5 


US 6077707 (Tsien et al) describes a blue fluorescent protein (BFP) 






containing the F64L mutation in combination with Y66H and US 6194548 (Osumi 






et al) describes a further BFP containing the F64L, Y66H, Y145F and L236R 






substitutions. 




10 


The present invention provides novel engineered derivatives of green 






fluorescent protein (GFP) which exhibit enhanced fluorescence relative to wtGFP 






when expressed in non-homologous cells at temperatures above 30°C, and 






when excited at about 490 nm compared to the parent proteins, i.e. wtGFP. 






Mutant GFPs produced using the method of the invention provide a means for 




15 


detecting GFP reporters in mammalian cells at lower levels of expression and/or 






increased sensitivity relative to wtGFP. This greatly improves the usefulness of 






fluorescent proteins in studying cellular functions in living cells. The multiply- 






mutated GFPs of this invention have fluorescence properties which are not 






predictable from the properties of the individual mutations when studied in 




20 


isolation. Furthermore, the inventors have surprisingly found that certain GFPs 






according to the present invention, which do not contain any mutations in the 






chromophore region relative to wtGFP, exhibit enhanced fluorescence compared 






with mutant GFPs described previously. 




25 


In a first aspect of the invention, there is provided a fluorescent protein 






which is derived from Green Fluorescent Protein (GFP) or any functional GFP 






analogue and has an ammo acid sequence which is modified by amino acid 






substitution compared with the amino acid sequence of wild type Green 






Fluorescent Protein said modified fluorescent protein comprising: 




30 


i) an amino acid substitution at position F64; 






ii) a single amino acid substitution at a position selected from the group 






consisting of positions S65 and E222, and 
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iii) a single amino acid substitution at a position selected from the group 

consisting of positions V163 and S175; 
wherein said modified GFP has a different excitation spectrum and/or emission 
spectrum compared with wild type GFP. 

5 

Suitably, the amino acid F at position 64 may be substituted by an amino 
acid selected from the group consisting of U I, V, A and G f thereby providing 
F64L, F64l, F64V, F64A, or F64G substitutions. Preferably, the amino acid 
substitution at position 64 is the F64L substitution. 

10 

In embodiments where the amino acid S at position 65 is substituted, it is 
suitably substituted by an amino acid selected from the group consisting of G, A, 
L, C, V, 1 and T, thereby providing S65G, S65A, S65L, S65C, S65V. S651 or 
S65T substitutions. Preferably, the amino acid substitution at position 65 is the 
;5 S65T substitution. 

In embodiments where the amino acid E at position 222 is substituted, it is 
suitably substituted by an amino acid selected from the group consisting of G, K 
V, L, I. F, S. T f N and Q, thereby providing E222G, E222A, E222V, E222L, E222f, 
20 E222F, E222S, E222T, E222N or E222Q substitutions. Preferably, the amino 
acid substitution at position 222 is E222G. 

In one preferred embodiment of the first aspect, the amino add V is 
substituted by A at position 163. In a second preferred embodiment of the first 
25 aspect, the amino acid S is substituted by G at position 175. 

Suitably, the novel fluorescent proteins exhibit high fluorescence in cells 
expressing them when said cells are incubated at a temperature of 30 °C or 
above, preferably at a temperature of from 32 °C to 39 C C, more preferably from 
so 35 °C to 38 °C and most preferably at a temperature of about 37 °C. 

Preferably, the fluorescent protein according to the first aspect has an 
amino acid sequence which is modified by amino acid substitution compared with 
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the amino acid sequence of wild iype Green Fluorescent Protein having the 
sequence: SEQ ID No.2. 

A preferred protein according to the present invention is a protein in which, 
5 in relation to SEQ ID No.2 of GFP, the amino acid F at position 64 has been 
substituted by L f the amino acid V at position 163 has been substituted by A and 
the amino acid E at position 222 has been substituted by G, and is shown herein 
as having the amino acid sequence as set forth in SEQ ID No. 3. 

to An alternative preferred protein according to the present invention is a 

protein in which, in relation to SEQ ID No.2 of GFP, the amino acid F at position 
64 has been substituted by U the amino acid S at position 175 has been 
substituted by G and the amino acid E at position 222 has been substituted by G, 
and is shown herein as having the amino add sequence as set forth in SEQ ID 

is No.4. 

A further preferred protein according to the present invention is a protein in 
which, in relation to SEQ ID No.2 of GFP, the amino acid F at position 64 has 
been substituted by L, the amino ecid S at position 65 has been substituted by T 
20 and the amino acid S at position 175 has been substituted by G, and is shown 
herein as having the amino acid sequence as set forth in SEQ ID No. 5. 

A further preferred protein according to the present invention is a protein in 
which, in relation to SEQ ID No.2 of GFP f the amino acid F at position 64 has 
25 been substituted by L, the amino acid S at position 65 has been substituted by T 
and the amino acid V at position 163 has been substituted by A, and is shown 
herein as having the amino acid sequence as set forth in SEQ ID No.6. 

Suitably, the GFP or functional GFP-analogue used to derive the 
30 fluorescent protein may be obtained from any convenient source. For example, 
native GFP derived from species of the genus Aequorea, suitably Aequorea 
victoria. The chromophore in wtGFP from Aequoma victoria is at positions 65-67 
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of the predicted primary amino acid sequence (SEQ ID No.2). In a preferred 
gmbodiment, the GFP is derived from Aequorea victoria. 



The modified proteins of the present invention may be produced by 
5 introducing mutations in a sequence of the nucleic acid that encodes the protein. 
As used herein, a preferred sequence of the gene encoding wtGFP is derived 
from Aequorea victoria, published by Chaffie et al, (Science, (1S94) t 263, 802-5) 
disclosed as SEQ. ID. No1 (Figure 1). The corresponding amino acid sequence 
is shown in SEQ. ID. No2 (Figure 2). Alternative sequences of the GFP gene 
io may be used, for example, the nucleotide (and predicted amino acid) sequences 
of the GFP gene described by Prasher et al, (Gene (1992), 111. 229) and the 
sequences as disclosed in WO 97/1 1094. In addition, alternative gene 
sequences that encode the fluorescent protein may incorporate a consensus 
Kozak nucleotide sequence (Kozak, M., Cell (1986), 44, 283), or preferred 
is mammalian codons, to provide improved translation in mammalian systems. The 
nucleotide sequence corresponding to the fluorescent protein may also encode 
useful restriction enzyme sites and additional elements such as target sites for 
enzymes and purification tags. Methods for incorporation of a Kozak region, 
preferred mammalian codons, restriction enzyme sites, enzyme sites and 
20 purification tags ane well known in the art and may result in the incorporation of 
amino acid residues and a change in numbering of amino acid residues in the 
fluorescent protein relative to the wtGFP numbering in the sequence provided. 

Herein, the abbreviations used for the amino adds are those stated in 
25 J.Biot.Chem.. (1968), 243, 3558. 

In a second aspect of the invention, there is provided a fusion compound 
comprising a protein of interest fused to a fluorescent protein which is derived 
from Green Fluorescent Protein (GFP) or any functional GFP analogue and has 
30 an amino acid sequence which is modified by amino acid substitution compared 
with the amino acid sequence of wild type Green Fluorescent Protein said 
modified fluorescent protein comprising: 
i) an amino acid substitution at position F64; 
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ii) a single amino acid substitution at a position selected from the group 






consisting of positions S65 and E222; and 






Hi) a single amino acid substitution at a position selected from the group 






consisting of positions V163 and S175; 




5 


wherein said modified GFP has a different excitation spectrum and/or emission 






spectrum compared with wild type GFP. 






In the context of the present invention, the term "protein of interest' is 






intended also to encompass polypeptides and peptide fragments. Examples of 




10 


such proteins of interest may be selected from the group consisting of: NFxB and 






subunits thereof, RAC1, PLC domains. MAPKAP2, PKC, Cytochrome C, RHO, p- 






actin, STAT6, protein kinase C tsotypes, LAMP1/2 TGN, ATP7ATGN and 






GLUT4. 




15 


In a third aspect of the present invention there is provided a nucleic acid 






molecule comprising a nucleotide sequence encoding a fluorescent protein which. 






is derived from Green Fluorescent Protein (GFP) or any functional GFP analogue 






and has an amino acid sequence which is modified by ammo acid substitution 






compared with the amino acid sequence of wild type Green Fluorescent Protein 




20 


said modified fluorescent protein comprising: 






i) an amino acid substitution at position F64; 






ii) a single amino acid substitution at a position selected from the group 






consisting of positions S65 and E222; and 






iii) a single amino acid substitution at a position selected from the group 




25 


consisting of positions V163 and S175; 






wherein said modified GFP has a different excitation spectrum and/or emission 






spectrum compared with wild type GFP. 


- 




Preferably, the nucleic acid molecule according to the third aspect 




30 


encodes a fluorescent protein having an amino acid sequence which is modified 






by amino acid substitution compared with the amino acid sequence of wild type 






Green Fluorescent Protein having the sequence: SEQ ID No.2. 
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In a particular embodiment of the third aspect, the nucleic acid molecule 
comprises a nucleotide sequence encoding a fluorescent protein derived from 
Green Fluorescent Protein (GFP) or any functional GFP analogue according to 
the invention fused to a nucleotide sequence encoding a protein of interest. 

5 

Preferably, the nucleic acid molecule is a constaict comprising a ONA 
sequence. 

Preferably, the nucleic acid molecule encodes a fluorescent protein having 
10 an amino acid sequence selected from the group consisting of SEQ ID No.3 f 
SEQ ID No.4, SEQ ID No.5 and SEQ ID No.6. 



As is well known, a single amino acid may be encoded by more than one 
nucleotide codon and thus each of the above nucleotide sequences may be 

15 modified to produce an alternative nucleotide sequence that encodes the same 
peptide. Thus, the preferred embodiments of the invention include alternative 
DNA sequences that encode the preferred proteins as previously described. It is 
to be understood that the preferred proteins (and the nucleic acid sequences from 
which they are derived), may include additional residues, particularly N- and C- 

20 terminal amino acids, or 5 - or 3-nucleotide sequences, and still be essentially as 
described herein. 



Suitably, the DNA construct encoding the novel fluorescent proteins may 
be prepared synthetically by established methods, e.g. the phosphoramldite 
25 method described by Beaucage and Caruthers, (Tetrahedron Letters (1981), 22, 
1859-1869), or the method described by Matthes etal., (EMBO Journal (1984), 3, 
801-805). According to the phosphoramidrte method, oligonucleotides are 
synthesized, e.g. in an automatic DNA synthesizer, purified, annealed, ligated 
and cloned into suitable vectors. 

30 

The DNA construct encoding the fluorescent protein may also be prepared 
by recombinant DNA methodology, for example cDNA cloning. See for example, 
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Sambrook, J. et al (1989) Molecular Cloning - A Laboratory Manual, Cold Spring 
Harbor Laboratory Press. 



The DNA construct may also be prepared by polymerase chain reaction 
5 (PCR) using specific primers, for instance as described in US 4683202 or by 
Saiki et al (Science (1988) t 239, 487-491 ). A recent review of PCR methods may 
be found in PCR Protocols, (1990), Academic Press, San Diego, California, USA. 



The gene sequence encoding the fluorescent protein may be joined in- 
io frame with a gene encoding the protein of interest and the desired fusion protein 
produced when inserted into an appropriate expression vector. For example, 
polymerase chain reaction or complementary oligonucleotides may be employed 
to engineer a polynucleotide sequence corresponding to the fluorescent protein, 
5' or 3* to the gene sequence corresponding to the protein of interest. 
is Alternatively, the same techniques may be used to engineer a polynucleotide 
sequence corresponding to the fluorescent protein sequence 5' or 3' to the 
multiple cloning site of en expression vector prior to insertion of a gene sequence 
encoding the protein of interest The polynucleotide sequence corresponding to 
the fluorescent protein sequence may comprise additional nucleotide sequences 
20 to include doning sites, linkers, transcription and translation initiation and/or 
termination signals, labelling and purification tags. 



In a fourth aspect, there is provided an expression vector comprising 
suitable expression control sequences operably linked to a nucleic acid molecule 

25 according to the present invention. The DNA construct of the invention may be 
inserted into a recombinant vector, which may be any vector that may 
conveniently be subjected to recombinant DNA procedures. The choice of vector 
will often depend on the host cell into which it is to be introduced. Thus, the 
vector may be an autonomously replicating vector, ie. a vector which exists as an 

30 extrachromosomal entity, the replication of which is independent of chromosomal 
replication, eg. a plasmid. Alternatively, the vector may be one which, when 
introduced into a host cell, is integrated into the host cell genome and replicated 
together with the chnomosome(s) into which it has been integrated. 
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15 



20 



25 



The vector is preferably an expression vector in which the DNA sequence 
encoding a fluorescent protein of the invention is operably linked to additional 
segments required for transcription of the DNA. In general, the expression vector 
is derived from plasmid or viral DNA, or may contain elements of both. The term. 
"operably linked" indicates that the segments are arranged so that they function 
in concert for their intended purposes, e.g. transcription initiates in a promoter 
and proceeds through the DNA sequence coding for the fluorescent protein of the 
invention. 

The promoter may be any DNA sequence which shows transcriptional 
activity in a suitable host cell of choice, (eg. a bacterial ceil, a mammalian cell, a 
yeast cell, or an insect cell) for expressing a fluorescent protein. The promoter 
may be derived from genes encoding proteins either homologous or heterologous 
to the host cell. 

Examples of suitable promoters for directing the transcription of the DNA 
sequence encoding the fluorescent protein of the invention in mammalian cells 
are the CMV promoter (US 5168062, US5385839), Ubiquitin C promoter (Wulff, 
M. et al., FEBS Lett (1990), 261, 101-105), SV40 promoter (Subramani etal., 
Mol. Cell Biol. (1981). 1, 854-364) and MT-1 (metallothionein gene) promoter 
(Palmiter et al.. Science (1 983), 222, 809-814). An example of a suitable 
promoter for use in insect cells is the polyhedrin promoter (US 4745051 ; 
Vasuvedan et al„ FEBS Lett, (1992) 311, 7-11). Examples of suitable promoters 
for use in yeast host cells include promoters from yeast glycolytic genes 
(Hitzeman et al., J. Biol. Chem., (1980), 255, 12073-12080; Alberand Kawasaki, 
J. Mol. Appl. Gen., (1982), 1, 419-434) or alcohol dehydrogenase genes (Young 
et al., in Genetic Engineering of Microorganisms for Chemicals (Hollaender et al, 
eds.), Plenum Press. New York, 1 982), or the TPI1 (US 459931 1 ) or ADH2^c 
(Russell et al., Nature, (1983), 304, 652-654) promoters. 

Examples of suitable promoters for use in bacterial host cells include the 
promoter of the Bacillus stearothermophilus maltogenic amylase gene, the 
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Bacilius liohenrformis alpha-arnylase gene, the Bacillus amytoliquefaciens BAN 
amylase gene, the Bacillus subtiUs alkaline protease gene, or the Bacillus pumilus 
xylosidase gene, or the phage Lambda PR or PL promoters or the Escherichia 
coli lac, trp or tac promoters. 

5 

The DNA sequence encoding the novel fluorescent proteins of the 
invention may also, if necessary, be operably connected to a suitable terminator, 
such as the human growth hormone terminator (Palmiter et at., op. cit) or (for 
fungal hosts) the TPI1 (Alber and Kawasaki, op. cit.) or ADH3 (McKnight et a)., 
10 op. cit.) terminators. The vector may further comprise elements such as 
polyadenylation signals (e.g. from SV40 or the adenovirus 5 Elb region), 
transcriptional enhancer sequences (e.g. the SV40 enhancer) and transitional 
enhancer sequences {e.g. the ones encoding adenovirus VA RNAs). 



15 The vector may further comprise a DNA sequence enabling internal 

ribosomal entry and expression of two proteins from one bicistronic transcript 
mRNA molecule. For example, the internal ribosomal entry sequence from the 
encephalomyocardttis virus (Rees S, et al, BioTechniques (1996), 20, 102-1 10 
and US 4937190). 

20 

The recombinant vector may further comprise a DNA sequence enabling 
the vector to replicate in the host cell in question. An example of such a 
sequence (when the host cell is a mammalian cell) is the SV40 origin of 
replication. 

25 

When the host cell is a yeast cell, examples of suitable sequences 
enabling the vector to replicate are the yeast plasmid 2^i replication genes REP 
1 -3 and origin of replication. 

30 The vector may also comprise selectable markers, such as a gene that 

confers resistance to a drug, e.g. ampicillin, kanamycin, tetracyclic 
chloramphenicol, puromycin, neomycin or hygromycin. 
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The procedures used to ligate the DNA sequences coding for the 
fluorescent protein of the invention, the promoter and optionally the terminator 
and/ or targeting sequence, respectively, and to insert them into suitable vectors 
containing the information necessary for replication, are well known to persons 
skilled in the art (eg. Sambrooketal., op.cit.). 

In a fifth aspect of the invention, there is provided a host cell transformed 
or transfected with a DNA construct comprising an expression vector according to 
the present invention. 



10 



The DNA construct or the recombinant vector of the invention is suitably 
introduced into a host cell which may be any cell which is capable of expressing 
the present DNA construct and includes bacteria, yeast and higher eukaryotic 
cells (Unger, T.F., The Scientist (1997), 11(17), 20-23; Smith, C, The 
15 Scientist (1 998), 12(22): 20; Smith, C, The Scientist (1998), 12(3), 1 8; 

Fernandez, J.M. & Hoeffler, J.P.. Gene Expression Systems- using nature for the 
art of expression. Academic Press 1999). 

Examples of bacterial host cells which, on cultivation, are capable of 
20 expressing the DNA construct of the invention are Gram-positive bacteria, eg. 

species of Bacillus or Gram-negative bacteria such as E. coll. The transformation 
of the bacteria may be effected by using competent cells in a manner known per 
se (cf. Sambrook et al., supra). 

25 Examples of suitable mammalian cell lines are the HEK293 and the HeLa 

cell lines, primary cells, and the COS (e.g. ATCC CRL 1650), BHK (eg. ATCC 
CRL 1632, ATCC CCL 10), CHL (e.g. ATCC CCL39) or CHO (eg. ATCC CCL 61) 
cell lines. Methods of transfecting mammalian cells and expressing DNA 
sequences introduced in the cells are described in eg. Kaufman and Sharp, J. 

30 Mol. Biol., (1982). 159. 601-621; Southern and Berg. J. Mol. Appl. Genet.. (1982), 
1 . 327-341: Loyter et al.. Proc. Natl. Acad. Sci. USA, (1982), 79, 422-426; Wlgler 
et al.. Cell, (1978). 14. 725; Corsaro and Pearson. Somatic Cell Genetics, (1981), 
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7, 603, Graham and van der Eb. Virology (1973), 52, 456; and Neumann et at., 






EMBO J., (1982), 1. 841-845. 






Examples of suitable yeast cells include cells of Sacchanomyces spp. or 




5 


Schizosaccharomyces spp., in particular strains of Saccharornyces cerevisiae cr 






Saccharornyces kiuyverL Methods for transforming yeast cells with heterologous 






DNA and producing heterologous polypeptides therefrom are described, e.g. in 






US 459931 1 , US 4931373, US 4870008, US 5037743, and US 4845075, all of 






which are hereby incorporated by reference. Transformed cells are selected by a 




10 


phenotype determined by a selectable marker, commonly drug resistance or the 






ability to grow in the absence of a particular nutrient, e.g. leucine. A preferred 






vector for use in yeast is the POT1 vector disclosed in US 4931373. The DNA 






sequence encoding the fluorescent protein of the invention may be preceded by a 






signal sequence and optionally a leader sequence , e.g. as described above. 




15 


Further examples of suitable yeast cells are strains of Kluyvaromyces, such as K. 






lactis, Hansenufa, e.g. H. polymorpha, or Pichia, e.g. P. pastons (cf. Gleeson et 






al., J. Gen. Microbiol.. (1986), 132, 3459-3465; US 4882279). 






Transformation of insect cells and production of heterologous polypeptides 




20 


therein may be performed as descnbed m US 4745051; US 4879236; US 






5155037; US 5162222; EP 397485, all of which are incorporated herein by 






reference. The insect cell line used as the host may suitably be a Lepidoptera 






cell line, such as Spodoptera frugiparda cells or Tnchopfusia nt cells (cf. US 






5077214). Culture conditions may suitably be as descnbed in, for instance, WO 




25 


89/01029 or WO 89/01028, or any of the aforementioned references. 






In a sixth aspect, the invention provides a method for preparing a Green 






Fluorescent Protein (GFP) or a functional GFP analogue according to the present 






invention, the method comprising cultivating a host cell transformed or 




! 30 


transfected with a nucleotide sequence according to the Invention and obtaining 






therefrom the polypeptide expressed by said nucleotide sequence. 
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Suitably. the transformed or transfeded h03t cells as described above are 
cultured in a suitable nutrient medium under conditions permitting the expression 
of a DNA construct according to the invention, after which the cells may be used 
in the screening method of the invention. Alternatively, the cells may be 
disrupted after which cell extracts and/or supematants may be analysed for 
fluorescence and/ or used to purify the GFP or functional GFP analogue of the 
invention. 



The medium used to culture the cells may be any conventional medium 
io suitable for growing the host cells, such as minimal or complex media containing 
appropriate supplements. Suitable media are available from commercial 
suppliers or may be prepared according to published protocols (eg. in catalogues 
of the American Type Culture Collection; Sambrook et al.. supra). 

L3 For example, a fusion protein comprising glutathione S-transferase (GST) 

and GFP can be constructed and expressed in E. colt. The GFP may be joined 
in-frame to the C-terminus of GST in a pGEX plasmid vector (Arnersham 
Pharmacia Biotech). Recombinant production of the fusion protein is carried out 
utilising a standard E. coll expression host, followed by purification employing 

20 glutathione affinity chromatography and removal of the GST tag by proteolytic 
cleavage. 

In a seventh aspect of the present invention, there is provided a method of 
measuring the expression of a protein of interest in a cell. The method 

25 comprises: i) introducing into a cell a nucleic acid molecule comprising a 

nucleotide sequence encoding a fluorescent protein which is derived from Green 
Fluorescent Protein (GFP) or any functional GFP analogue according to the 
present invention said nudeic acid molecule being operably linked to and under 
the control of an expression control sequence which moderates expression of 

30 said protein of interest; ii) culturing the cell under conditions suitable for the 
expression of the protein of interest; and iii) detecting the fluorescence emission 
of the Green Fluorescent Protein (GFP) or a functional GFP analogue as a 
means of measuring the expression of the protein of interest 
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in an eighth aspect of the present invention, there is provided a method cf 
determining the cellular and/or extracellular localisation of a protein of interest 
which method comprises: 
5 i) introducing into a cell a nucleic acid molecule comprising a nucleotide 
sequence encoding a Green Fluorescent Protein (GFP) or a functional GFP 
analogue according to the invention fused to a nucleotide sequence encoding a 
protein of interest, said nucleic acid molecule being operably linked to and under 
the control of a suitable expression control sequence; 
io ii) culturing said cell under conditions suitable for the expression of said 
protein of interest; and 

iii) determining the cellular and/or extracellular localisation of said protein of 
interest by detecting the fluorescence emission by optical means. 

15 The fluorescent proteins of the present invention may also be used in a 

method to detect and compare the effect of a test substance on the regulation of 
expression and/or translocation of two or more different proteins of interest in a 
cell. Alternatively, they may be used in a method to compare the expression of a 
protein of interest and the simultaneous activity of an expression control 

20 sequence in response to a test substance. The fluorescent proteins may also be 
used in a method to compare the activity of two or more expression control 
sequences in a cell in response to a test substance. Such methods may be 
performed in the presence and in the absence of a test substance whose effect 
on the process is to be measured. For example, one detectable reporter 

25 molecule may be used as an internal reference and another as a variable marker, 
since regulated expression of a gene can be monitored quantitatively by fusion of 
an expression control sequence to a DNA construct encoding, eg. F64L-S175G- 
E222G-GFP, measuring the fluorescence, and normalising it to the fluorescence 
of a constitutively expressed spectrally distinct fluorescent molecule. The 

30 constitutively expressed spectrally distinct fluorescent molecule, for example 
BFP, acts as an internal reference. 
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Thus, in a ninth aspect of the present invention, there is provided a method 
of comparing the effect of one or more test substance(s) on the expression and/or 
localisation of one or more different protein(s) of interest in a cell which method 
comprises: 

5 

i) introducing into a cell: 

a) a nucleic acid molecule comprising a nucleotide sequence encoding a 
Green Fluorescent Protein (GFP) or a functional GFP analogue according to the 
invention optionally fused to a nucleotide sequence encoding a first protein of 

io interest, said nucieic acid molecule being operably linked to and under the control 
of a first expression control sequence; and optionally, 

b) at least one different nucleic add molecule encoding a protein reporter 
molecule optionally ftjsed to a different protein of interest, each said nucleic add 
molecule being operably linked to and under the control of a second expression 

15 control sequence wherein said protein reporter molecule has or is capable of 
generating an emission signal which is spectrally distinct from that of said Green 
Fluorescent Protein (GFP) or a functional GFP analogue; 

ii) culturing said cells under conditions suitable forthe expression of said 
protein(s) of interest in the presence and absenoe of said test substance(s); 

20 lii) determining the expression and/or localisation of said protein(s) of interest 
in said cells by detecting the fluorescence emission by optical means; and 
iv) comparing the fluorescence emission obtained In the presence and 
absence of said test substance(s) to determine the effect of said test 
substance(s) on the expression and/or localisation of said proteln(s) of Interest 

25 

In a preferred embodiment of the ninth aspect, samples of said cells in a 
fluid medium are introduced into separate vessels for each of said test 
substances to be studied. 

30 Preferably, the first and second expression control sequences are 

different. 
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25 



Suiiably, the protein reporter molecule may be selected from the group 
consisting of fluorescent proteins and enzymes. Preferred fluorescent proteins 
are those which have a spectrally distinguishable emission wavelength compared 
with the emission wavelength of the fluorescent proteins according to the present 
invention, for example, BFP. Suitable enzyme reporters are those which are 
suitable for generating a detectable (eg. a luminescent or fluorescent) signal in a 
substrate. Suitable enzyme/substrates include: luciferase/luciferin; p- 
galactosidase/DDAO galactoside; f^galactosidasetfluorescein di-|J-D- 
galactopyranoside; alkaline phosphatase/Attophos. 

In the methods of the invention, the fluonescence of cells transformed or 
transfected with the DNA construct according to the invention may suitably be 
measured by optical means in for example; a spectrophotometer, a fluorimeter, a 
fluorescence microscope, a cooled charge-coupled device (CCD) imager (such 
as a scanning imager or an area imager), a fluorescence activated eel! sorter, a 
confocal microscope or a scanning confbeal device, where the spectral properties 
of the cells in culture may be determined as scans of light excitation and 



The fluorescent proteins of the present invention have many additional 
applications, for example: 

i) Use as a non-toxic marfcer for selection of transfected cells containing an 
expression vector encoding at least the fluorescent protein of the invention. The 
fluorescent emission may be used to isolate transfected cells thereby overcoming 
the need for selection with toxic molecules such as antibiotics. 

j[) Use as a protein label in living and fixed cells- The novel proteins exhibit 
strong fluorescence and are a suitable label for proteins present at low 
concentrations. Since no substrate is needed and visualization of the fluorescent 
protein does not damage the cells, dynamic analysis can be performed. 
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iii) Use as marker in ceil or organelle fusion. By labelling one or more cells or 
organelles with the novel proteins, for example, F64L-S175G-E222G-GFP, and 
other cells or organelles with same or another fluor, fusions such as heterokaryon 
formation can be monitored. 

5 

Iv) Translocation of proteins fused to the novel proteins of the invention can 
be visualised. The translocation of intracellular proteins to a specific organelle 
can be visualised by fusing the protein of interest to a fluorescent protein, for 
example. F64L-S175G-E222G-GFP and labelling the organelle with another 
io fluorescent molecule, eg. fluorescent protein. Translocation can then be detected 
as a spectral shift in the fluorescent proteins in the specific organelle. 



v) Use as a secretion marker. By fusion of a fluorescent protein of the 
invention to a signal peptide or a peptide to be secreted, secretion may be 

15 followed in living cells. 

vi) Use as genetic reporter or protein tag in transgenic animals. Due to the 
strong fluorescence of the novel proteins, they are suitable as tags for proteins 
and gene expression, since the signal to noise ratio is significantly improved over 

20 the prior art proteins, such as wild-type GFP. 

vii) Use as a cell or organelle integrity marker. By expressing the novel 
proteins targeted to an organelle, it is possible to calculate the leakage of the 
protein and use that as a measure of cell integrity. 

25 

viii) Use as a transfection marker, and as a marker to be used in combination 
with FACS sorting (eg. as described in Example 3). Due to the increased 
brightness of the novel proteins the quality of cell detection and sorting can be 
significantly improved. 

30 

ix) Use as real-time probe working at near physiological concentrations. 
Since the novel proteins of the present invention are significantly brighter than 
wtGFP when expressed in cells at about 37 °C and excited with light at about 
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490 nm : the concentration needed for visualization can be lowered. Target sites 




for enzymes engineered into the novel proteins, for example Fo4L-bi /bCa- 




E222G-GFP, can therefore be present in the cell at low concentrations in living 




cells. This is important for two reasons: i) The probe must interfere as little as 


5 


possible with the intracellular process being studied, and ii) the translations! and 




transcriptional apparatus should be stressed minimally. 




x) Transposon vector mutagenesis can be performed using the novel 




proteins as markers in transcriptional and translation^ fusions. Transposons 


10 


may be used in microorganisms encoding the novel proteins. The transposons 




may be constructed for transiational and transcriptional fusion to be used for 




screening for promoters. Transposon vectors encoding the novel proteins, for 




example F64L-S175G-E222G-GFP, tian be used for tagging plasmids and 




chromosomes. 


xi) Use as a reporter for bacterial detection by introducing the novel proteins 




into the genome of bacteriophages. By engineering the novel proteins, for 




example F64L-S175G-E222G-GFP, into the genome of a phage a diagnostic tool 




can be designed. F64L-S175G-E222G-GFP will be expressed only upon 


! 20 


transfection of the genome into a living host. The host specificity is defined by 




the bacteriophage. 




The Invention is further illustrated by reference to the following examples 




and figures in which: 


25 


Figure 1 is the nucleotide Sequence of wtGFP (Chame et a!, Science, (1994), 




263, 802-5) and referred to herein as SEQ ID IMo.1. 




Figure 2 is the corresponding amino acid sequence of wtGFP (Chalfie et al, 




Science, (1994), 263, 802-5) and referred to herein as SEQ ID No.2. 




Figure 3 is the predicted ammo acid sequence of F64L-vi63A-E22Zvj>-Cjhr ana 


30 


referred to herein as SEQ ID No.3. 




Figure 4 is the predicted ammo acid sequence of F64L-S175G-E222G-GFP and 




referred to herein as SEQ ID No.4. 
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Figure 5 is the predicted amino acid sequence of F64L-S65T-51 zovj-vjrr' ana 




referred to herein as SEQ ID No.5. 




Figure 6 is the predicted amino acid sequence of F64L-S65T-V1 d^a-c^i-p ana 




referred to herein as SEQ ID No.6. 


5 


Figure 7 is a plot showing average fluorescence intensities ot mutant ut-h's 




according to the invention. 




EXAMPLES 




1 Clonina of GFP oene and temolate vector construction 




The GFP gene used in the present study was contained within the plasmid 




pGFP (Chatfie et aL, Science, (1934), 263, 802-805; GenBank accession number 




y 17997) obtained from Clontech Laboratories Inc. (Palo Alto, Ca, USA). The 


15 


gene was amplified by PCR using Pfu polymerase (Promega, Madison, Wl, USA) 




according to recognised protocols (Saiki et al., Science, (1938), 4of-4yi 




The sequences of primers used were: 




GFP-1 S'-ggtacgggccgccaccatgagtaaaggagaagaacttttcac SEQ ID No.7 




GFP-2 5 , -ggtacgggttaaccggttttgtatagttcatccatg SEQ ID No.8 




GFP-3 5-ggtacgggccgccacca^iggatccaaaggagaagaacttttcac SEQ ID No.9 


20 


Primer GFP-1 exhibits homology to the 5' region of the GFP gene and 




contains a partial Kozak site (Kozak, M, Cell P (1986), 4^ 283) pnor to the start 




codon for efficient initiation of translation in mammalian systems. Primer GFP-2 




exhibits homology to the 3' region of the GFP gene and contains an additional 




Agel restriction enzyme site immediately prior to the stop codon to facilitate 


25 


cloning of proteins by fusion to the Cterminus of GFP. Primer GFP-3 is similar to 




primer GFP-1 exhibiting homology to the 5* region of the GFP gene, but contains 




an additional restriction site (3?mHI) immediately after the initiation codon to 




facilitate cloning of proteins by fusion to the N-tenminus of GFP. Amplified 




products resulting from PCR reactions containing pnmers GFP-1 and GFP-2, and 


30 


GFP-3 and GFP-2 were tailed with a single 3'-deoxyadenosine using Taq 
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polymerase (Amersham Pharmacia Biotech, Amersham, UK) and ligated into the 
TA cloning vector pTARGET (Promega) according to manufacturer's instructions. 
The correct orientation relative to the CMV promoter and sequence of the insert 
was determined by automated DNA sequencing. 

2. Generation of mutants of GFP 



The following mutants of GFP were generated in the present study: F54L- 
GFP, V163A-GFP. S175G-GFP, E222G-GFP, F64L-E222G-GFP. F64L-V163A- 

10 GFP, F64L-S175G-GFP. V163A-S175G-GFP, V163A-E222G-GFP, S175G- 
E222G-GFP, F64L-S175G-E222G-GFP, V163A-S175G-E222G-GFP, F64L- 
V163A-E222G-GFP, F64L-S65T-S175G-GFP, F64L-S65T-V163A-GFP. Mutants 
of the GFP gene (SEQ ID 3) construct within pTARGET (See Example 1) were 
generated using the QuikChange™ site-directed mutagenesis kit (Stratagene, La 

is Jolla. Ca, USA) according to manufacturer's instructions. The sequences of 

primers used to generate F64L, S65T, V163A, S175G and E222G single mutants 
have been documented in Table 1. Multiply-mutated GFP molecules were 
generated through successive mutagenesis reactions. All GFP mutant 
sequences were verified by automated sequencing. 



20 

Table 1 



Primer 


Mutation 


Seauenoe (& - 3') 


SEQ 








ID No. 


GFP-64f 


F64L 


ccaacacttgtcactactctctcttatggtgttcaat 


10 


GFP-64r 


F64L 


attgaacaccatasgagagagtagtgacaagtgttgg 


11 


GFP-65f 


S65T 


ccaarac^ca<^ctctcacctatggtgttcaatgcttttca 


12 


GFP-65r 


S65T 


tgaaaagcattgaacaccataggtgagagtagtgacaagtgtigg 


13 


GFP-163f 


V163A 


gacaaacaaaagaatggaatcaaegccaacttcaaaattagacac 


14 


GFP-163r 


V163A 


gtgtctaattttgaagttggctttgattccaltcttttgttlgtc 


15 


GFP-175f 


S175G 


caacattgaagatggaggcgttcaactagcagacc 


16 


GFP-175r 


S175G 


ggtctgctagttgaacgcctccatcttcaatgttg 


17 


GFP-222f 


F???G 


ccacatggtccttsttggctttgtaacagctgctgg 


18 


GFP-222r 


E222G 


ccagcagctgttacaaagccaagaaggaccatgtgg 


19 
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3. Influence of individual mutations and combina tions of F64L. S65T. V163A, 
S175G and E222G mutations uoon GFP when expressed in mammalian 
cells 

Plasmid DNA to be used for transfection was prepared for all GFP and 
EGFP constructs using the HiSpeed plasmid purification kit (Qiagen, Westberg, 
NL). DNA was diluted to 100 ng. ul-1 in 18-Megohm water (Sigma) and 1 pg 
used for transections. For 50-80% confluency on the day of transfection, HeLa 
ceils were plated at a density of 5x10 4 /well in 6-well plates and incubated 
overnight. A 1 :3 (1 ug : 3 pi) ratio of DNA to FuGene6 reagent (Roche) was used 
for each transient transfection reaction: 3 ul FuGene6 was added to 87 ul serum- 
free DMEM medium (Sigma) (containing penicilliri/streptomycin, L-glutamine 
(GibcoBRL) and gentry tapped to mix, then 10 ul (1 ng) construct DNA was added 
and again gently mixed. The FuGene6:DNA complex was incubated at room 
is temperature for 40 minutes then added dropwise directly to the cells without 
changing the medium, and the plates swirled for even distribution. 

Fluorescence measurements were made 24 or 48 hours after transfection. 
Briefly, the cells were washed in phosphate-buffered saline, released with the 

20 addition of 2 drops of Trypsin (GibcoBRL) and resuspended in 1 ml of complete 
DMEM medium (containing penicillin/streptomycin, L-glutamine and foetal bovine 
serum (Sigma). The cells were vortexed and analysed on a FACS Caliburflow 
cytometer (Becton Dickinson & Co., NJ, USA) for characterisation of whole cell 
fluorescence, with excitation at 488 nm and emission viewed with fluorescence 

25 filter set 530/30nm (range 51 5-545 nm). 10,000 events were collected for each 
transfection and 6-10 replicates carried out for each construct. Average 
fluorescent intensities from the FACS analysis were obtained as geometric 
means (mean fluorescence on log scale) and are shown in Figure 7. 



30 
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Claims 



1 . A fluorescent protein which is derived from Green Fluorescent Protein 
(GFP) or any functional GFP analogue and has an amino acid sequence which is 
5 modified by amino acid substitution compared with the amino acid sequence of 
wild type Green Fluorescent Protein said modified fluorescent protein comprising: 

i) an amino add substitution at position F64; 

ii) a single amino acid substitution at a position selected from the group 
consisting of positions S65 and E222; and 

10 iii) a single amino acid substitution at a position selected from the group 
consisting of positions V163 and S175; 

wherein said modified GFP has a different excitation spectrum anaVor 
emission spectrum compared with wild type GFP. 

\5 2. A fluorescent protein according to claim 1 wherein the amino acid F at 
position 64 has been substituted by an amino acid selected from the group 
consisting of L, I, V, A and G. 

3. A fluorescent protein according to claims 1 or 2 wherein the amino acid S 
20 at position 65 has been substituted by an amino acid selected from the group 

consisting of G, A, L, C. V, I and T. 

4. A fluorescent protein according to claims 1 or 2 wherein the amino acid E 
at position 222 has been substituted by an amino acid selected from the group 

25 consisting of G. A, V. L. I, F, S, T, N and Q. 

5. A fluorescent protein according to any of claims 1 to 4 wherein said V163 
mutation is V163A. 

30 6. A fluorescent protein according to any of claims 1 to 4 wherein said S1 75 
mutation is S175G. 
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7. A fluorescent protein according to any of claims 1 to 6 selected from the 
group consisting of: F64L-V163A-E222G-GFP, F64L-S175G-E222G-GFP. F64L- 
S65T-S175G-GFP and F64L-S65T-V163A-GFP. 

8 . A fluorescent protein according to any of claims 1 to 7 having an amino 
acid sequence which is modified by amino acid substitution compared with the 
amino add sequence of wild type Green Fluorescent Protein having the 
sequence: SEQ ID No.2. 

9. A fluorescent protein derived from Green Fluorescent Protein (GFP) and 
having the amino add sequence as set forth in SEQ ID No.3. 

1 0. A fluorescent protein derived from Green Fluorescent Protein (GFP) and 
having the amino add sequence as set forth in SEQ ID No.4. 

11. A fluorescent protein derived from Green Fluorescent Protein (GFP) and 
having the amino acid sequence as set forth in SEQ ID No.5. 

12. A fluorescent protein derived from Green Fluorescent Protein (GFP) and 
20 having the amino acid sequence as set forth in SEQ ID No.6. 

13. A fusion compound comprising a protein of interest fused to a fluorescent 
protein said fluorescent protein being a modified protein according to any of 
daims 1 to 12. 

23 

14. A nucleic add molecule comprising a nucleotide sequence encoding a 
fluorescent protein which is derived from Green Fluorescent Protein (GFP) or any 
functional GFP analogue and has an amino acid sequence which is modified by 
amino acid substitution compared with the amino acid sequence of wild type 

30 Green Fluorescent Protein said modified fluorescent protein comprising: 
I) an amino acid substitution at position F64; 

ii) a single amino acid substitution at a position selected from the group 
consisting of positions S65 and E222; and 
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iii) a single amino acid substitution at a position selected from the group 

consisting of positions V163 and S175; 
wherein said modified GFP has a different excitation spectrum and/or emission 
spectrum compared with wild type GFP. 

15. A nucleic acid according to claim 14 encoding a fluorescent protein having 
an amino acid sequence which is modified by amino acid substitution compared 
with the amino acid sequence of wild type Green Fluorescent Protein having the 
sequence: SEQ ID No.2. 

16. The nucleic acid molecule according to claims 14 or 15 encoding a 
fluorescent protein having an amino acid sequence selected from the group 
consisting SEQ ID No.3, SEQ ID No.4, SEQ ID No.5 and SEQ ID No.6. 

is 17. A nucleic acid molecule comprising a nucleotide sequence encoding a 
fusion protein comprising a protein of interest fused to a fluorescent protein 
according to any one of claims 1 to 12. 

18. An expression vector comprising suitable expression control sequences 
20 operably linked to a nucleic acid molecule according to any of claims 14 to 17. 

19. A host cell transformed or transfected with a DNA construct comprising an 
expression vector according to claim 18. 

25 20. The host cell according to claim 19 wherein said host cell is selected from 
the group consisting of a mammalian cell, a bacterial cell, a yeast cell and an 
insect cell. 

21 . A method for preparing a Green Fluorescent Protein (GFP) or a functional 
30 GFP analogue according to the present invention said method comprising 

cultivating a host according to claim 20 and obtaining therefrom the polypeptide 
expressed by said nucleotide sequence. 
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22. A method of measunng the expression ot a proiein ui hilg.ooi 




which Tnethod comprises: 




i) introducing into a cell a nucleic acid molecule comprising a nucleotide 


5 


sequence encoding a fluorescent protein which is derived from Green 




Fluorescent Protein (GFP) or any functional GFP analogue according to any one 




of claims 1 to 12 said nucleic acid molecule being operably linked to and unaer 




the control of an expression control sequence wnicn moaeraies expression o» 




said protein of interest; 


10 


ii) cultunng said cell under conditions suitable tor me expression oi s>tiiu 




protein of interest; and 




iii) detecting the fluorescence emission of said Green Fluorescent Protein 




(GFP) or a junctional GFP analogue as a means ot measuring me expression 


15 


said protein of interest. 


23. A method of determining the cellular and/or extracellular localisation ot a 




protein of interest which method comprises: 




i) introducing into a cell a nucleic acid molecule comprising a nucleotide 


20 


sequence encoding a fluorescent protein which is derived from careen 




Fluorescent Protein (GFP) or any functional GFP analogue according to any one 




of claims 1 to 12 fused to a nucleotide sequence encoding a protein of interest. 




said nucleic acid molecule being operaWy linked to and under the control of a 




suitable expression control sequence; 


25 


ii) cultunng said cell under conditions suitable Tor tne expression or saia 




protein of interest; and 




iii) determining the cellular and/or extracellular localisation ot saia proiein ot 




interest by detecting the fluorescence emission by optical means. 




24. A method of comparing the effect of one or more test substance(s) on the 




expression and/or localisation of one or more different protein(s) or interest in a 




cell which method comprises: 
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i) introducing into a cell: 

a) a nucteic acid molecule comprising a nucleotide sequence encoding a 
Green Fluorescent Protein (GFP) or a functional GFP analogue according to any 
one of claims 1 to 12 optionally fused to a nucleotide sequence encoding a first 

5 protein of interest, said nucleic acid molecule being operably linked to and under 
the control of a first expression control sequence; and optionally, 

b) at least one different nucleic acid molecule encoding a protein reporter 
molecule optionally fused to a different protein of interest, each said nucleic acid 
molecule being operably linked to and under the control of a second expression 

10 control sequence wherein said protein reporter molecule has or is capable of 
generating an emission signal which is spectrally distinct from that of saW Green 
Fluorescent Protein (GFP) or functional GFP analogue; 

ii) culturing said cells under conditions suitable for the expression of said 
protein(s) of interest In the presence and absence of said test substance(s); 

15 iii) determining the expression and/or localisation of said protein(s) of interest 
in said cells by detecting the fluorescence emission by optical means; and 
iv) comparing the fluorescence emission obtained in the presence and 
absence of said test substance(s) to determine the effect of said test 
s ubstance(s) on the expression and/or localisation of said protein(s) of interest. 

20 

25. The method according to claim 24 wherein samples of said cells in a fluid 
medium are introduced into separate vessels for each of said test substances to 
be studied. 
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Flaure 1/7 






m..~i~*.-h b SMiianffi nf wtGFP fChatfie et al. Science, f 1994), 26_3, 802-5): 




SEQ ID No.1 






acg agt aaa gga 9a* gaa ctt ttc act gga get gtc cca att ctt gtt 


40 




gaa tta gat: ggt gat gtt aat ggg cac aaa ttt tct gtc agt gga gag 


96 




ggt gaa ggt gat gca aca tac gga aaa ctt acc ctt aaa ttt att tgc 


144 




act act gga **a eta cct gtt cca tgg cca aca ctt gtc act act ttc 


192 




tct tat ggt gtt caa tgc ttt tea aga cac cca gat cat acg aaa egg 


240 




cat gac ttt ttc aag agt gec atg ccc ga* ggt tat gta cag gaa aga 


268 




act at a ttt ttc aaa gat gac ggg aac tac aag aca cgt get gaa gtc 


335 




aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt att 


384, 




gat ttt aaa gaa gat gga aac att ctt gga cac aaa ttg gaa tac aac 


432 




tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat gga 


480 




ate aaa gtt aac ttc aaa att aga cac aac att gaa gat gga age gtt 


528 




caa eta gca gac cac tat caa caa aat act cca att ggc gat ggc cct 


sie 




gtc ctt tta cca gac aac cat tac ctg ccc aca caa tct gec ctt teg 


624 




aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt gag ttt gta 


672 




aca get get ggg att aca cat ggc atg gac gaa eta tac aaa tag 


717 
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Figure 2/7 

Amino Acid Sequence of wtGFP fChaffie et aL Scienc e. (19941. 263, 802-5 

SEQ ID No.2 

Mec Ser Lye Sly Glu CJlu Leu Phe Thr Gly Val Val Pro lie Leu Val 
Glu Leu Asp Gly Asp Val Asn Gly Kis lys ?he Scr Val Ser Gly Glu 



Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 * 55 6C 

Scr Tyr Gly Val Gin Cye S>he Ser Arg Tyr Pro Asp His Met lye Arg 
65 70 75 CO 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly TyT Val Gin Glu Arg 
85 90 95 

Thr lie Phe Pbe hye Asp Asp Gly Acn Tyr Lys Thr Arg Ala Glu Val 

100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asa Arg lie Glu Leu Ly3 Gly He 
115 120 123 

Asp ?he Lys Glu Asp Gly Asn Tie Leu Gly Hie Lys Leu Glu Tyr AflJl 
130 135 140 

Tyr Asn Ser His Asn Val Tyr lie «c: Ala Asp Lys Gin Lys Aan Gly 
145 150 155 160 

He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly ser Val 
1*5 170 175 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pre 
180 185 190 

Val Leu Leu Pro Asp Asn Hi a Tyr Leu Ser Thr Gin Sex Ala Leu Ser 
195 200 203 

Lya Asp Pro Asa Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

Thr Ala Ala Gly He Thr His Gly Met. Asp Glu Leu Tyr Lys 
225 230 235 



20 



Gly Glu Gly Asp Ala Thr Tyr Gly Lys 
35 40 



Leu Thr Leu Lys Phe lie Cys 
*5 
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Figure 3/7 

Predicted Amino Acid Sequence of F64L-V163A-E2 22G-3FP: 

SEQ ID No.3 



Ket 5er Lys Gly Glu Glu Lev Phe Thr Gly Val Val Pro lie Leu Val 
1 5 10 15 

Glu Leu Aso Gly Asp Val Asn Gly His Lys P'ne Ser Val Ser Gly Glu 
20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Th- Leu Lys Phe lie Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 
50 55 60 

Ser Tyr Gly Val Gin Cye Phe ser Arg Tyr Pro Asp His Met Lys Axg 
65 70 75 30 

His Asp Phe Phe Lys Ser Ala Het Pro Glu Gly Tyr Val Gin Glu Arg 
as 30 95 

Thx lie Phe Phe Lye Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 135 110 

Lvs Phe GlU Oly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie 

115 "* 120 125 

Asp Phe Lys Glu Asp Gly ASH He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 1*0 

Tyr A3n Ser Sis Asn Val Tyr He Met Ala Asp Lys Gin Lys Asr- Gly 
145 150 155 1€0 

He Lys Ala Asn Phe Lys He Arg His Asr He Glu Aap Gly Ser val 
165 17C 175 

Gin lieu Ala Asp Hie Tyx Gin Gin Asn Thr- Pro life Gly Asp Gly Pro 
180 IBS 190 

Val Leu Leu Pro Asp ABn Hi 3 Tyr Leu Ser Thr Gin Ser Ala Leu ser 
195 2C0 205 

Lys Asp Pro Aen Glu Lys Axg Asp His Met Val Leu Leu Gly Phe Val 
210 215 220 

Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 
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Figure 4/7 

Predicted Amino Acid Sequence of F64L-S175G-E222G-GFP: 

SEQ ID No.4 



Me- Ser Lys Glv Clu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
"l " 5 10 15 

Glu Leu Asp Gly Asp Val Ann Gly His Lys Phe Ser Val Ser Gly aiu 
2C 25 20 

Glv Glu Glv Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 
35* ~ 40 45 

Thr Tar Gly Lye Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 
50 55 60 

Ser Tyr Gly Val Gin Cys Phe Sex Arg Tyr Pro Asp His Met Lys Arg 
65 * 70 "?5 80 

wis Asp Phe Phe Lve Ser Ala Met Pro Glu Gly Tyr val GZ-n Glu Arg 
85 90 9S 

Thr lie Phe Phe Lys Asp Asp Gly Aan Tyr Lys Thr Arg Ala Glu V*l 
100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Aen Arg He Glu Leu LyS Gly He 
115 120 125 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu. Glu Tyr A-m 
130 135 140 

TVr A8n Ser His Asn Val Tyr Tie Me- Ala Asp LyS Gin Lys Asn Gly 
145 153 155 160 

lie Lys Val Asn Phe Lys Jle Arg His Asn lie Glu Asp Gly Cly Val 
165 170 175 

Gin leu Ala Asp Kic Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
1B0 IBS 1^0 

Val Leu Leu Pro Asp Asn Kis Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

Lys As? Pro Asn Glu Lye Arg Asp His Met. Val Leu Leu Gly Phe Val 
210 215 220 

Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
325 230 235 
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Figure 5/7 

Predicted Amino Acid Sequence of F64L-S65 T-S175G-GFP: 

SEQ ID Na5 



wet Ser Lvs Gly Glu Glu Leu ?he Thr Sly Val Val Pro He Leu Val 
x 5 10 15 

Glu Leu ago Sly Asp Val Asn Gly His Lye ?he Ser Val ser Gly Glu 
23 25 30 

Glv Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
3 5 4C 45 

Tfcr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 
50 55 CO 

Thr Tvr Gly Val Gin Cvs ?he Ser Arg Tyr Pro Asp His Me- Lye Arg 
65 70 7S 80 

His Acr Phe Phe Lys Ser Ala Met Fro Glu Gly Tyr Val Gin Glu A^rg 
85 90 95 

Thr He Fha Phe Lys Asp Asp Gly Asn Tyr Lys Thx Arc Ala Glu Val 
100 105 HO 

Lys ?he C3lu Gly Asp Thr Leu Val Asn Ar$ He 31u Leu Lys Gly He 
115 120 125 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Aan 
130 135 140 

Tyr Asr. Ser Hie Asia Val Tyr He Met Ala Asp Lye Gin Lys Asn Gly 
145 150 155 -60 

lie Lys Val Asc ?he Lys He Arg Eaa Asn He Glu Asp 31y Gly Val 
165 170 175 

Gin Leu Ala Asp His Tyx Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
130 185 190 

Val Leu Leu Pro Asp Aan His Tyx Leu Ser Thr Gin Ser Ala Lea Ser 
195 200 205 

Lys Asp Pro Asn Glu Lys Arg Aap His Kefc Val Leu leu Glu Phe Val 
210 215 220 

Thr Ala Ala Gly He Thr Hie Gly Met Asp Glu Leu Tyr Lys 
225 230 235 
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Ficure 6/7 

Predicted Amino Acid Sequence of F64L-S65T-V163A-GFP: 

SEQ ID No.6 



Met Ser Lyo C2y Clu 3_u Leu ?he Thr Gly v'al Val Pro lie Leu Val 
1 5 10 15 

Glu Leu Asp Gly Asp Vai Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

Gly Civ Cly Asp Ala Tin: Tyr Gly lys Leu thr Leu Lys Phe lie Cys 
3 5 40 45 

Thr Thr Gly Lye Leu Pro Val Pro Trp Pro Thr Leu Val Tlix Thr Leu 
50 GO 

Thr Tyr Glv Val Gin eye Phe Ser Arg Tyx Pro As? Ket Lyc Arg 

65 ■ 70 75 60 

Kis Asp Phe Phe Lye Ser Ala Met Pro Glu Gly tyr V*l Gin Slu Arg 
65 90 95 

Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr i.ys Thr Arg Ala Glu Val 
100 1C5 110 

Lya Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 

Asp Plie Lys Glu Asp Gly Asn lie Leu Gly nis X.ye Leu Glu Tyr Asn 
130 135 140 

Tyr Asn Ser His Asn Val Tyr lie K«C Ala Asp Lys Gin Lys Asn Gly 
145 ISO 155 16C 

lie Lvs Ala Asn Phs Lys He Arg His Asn He Glu as? Gly Ser Vai 
1€5 170 175 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pre 
180 185 130 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Sin Ser Ala Leu Ser 

200 205 

Lys Asp Pro A*n Glu Lys Arg Asp Eis Met: Val Leu Leu Glu Phe Val 
210 215 220 

Thr Ala Ala Gly He Tiix His Gly Met Asp Glu Leu Tyr Lys 

225 230 235 
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Figure 7/7 



Flow cytometry of GFP mutations 



I 

3 

E 

fl- 
U- 
CS 



F64L:S175(3:E222G 
F54L:V163A:E222G 
F64L:S65T:V163A 
F64L;S65T;S175G 
F64L:E222G 
V163A:S175G 
F64L-V16SA 
F64L 

V163A:S175G:E222G 
V163A 
F64LS175G 
S175G 
V163A:E222G 
S175G:E222G 
E222G 
wtGFP 




20 33 40 50 60 

Fluorescence (geometric mean) 



70 



60 



0004751 23-Apr-01 02:06 | 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

15 BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

j& REFERENCED) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: ; ' . 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



